Assessing thesaurus-based annotations for semantic search applications
نویسندگان
چکیده
Statistical methods for automated document indexing are becoming an alternative to the manual assignment of keywords. We argue that the quality of the thesaurus used as a basis for indexing in regard to its ability to adequately cover the contents to be indexed and as a basis for the specific indexingmethod used is of crucial importance in automatic indexing.We present an interactive tool for thesaurus evaluation that is basedona combinationof statisticalmeasures and appropriate visualisation techniques that supports the detection of potential problems in a thesaurus. We describe the methods used and show that the tool supports the detection and correction of errors, leading to a better indexing result.
منابع مشابه
Finnish National Ontologies for the Semantic Web - Towards a Content and Service Infrastructure
We present a national ontology development and service framework being developed in Finland in 2003-2007. The framework is based on a set of related core ontologies, most notably on a national upper ontology based on the commonly used Finnish General Thesaurus YSA maintained by the National Library of Finland. The framework implements three ontology services by a web-based system ONKI. Firstly,...
متن کاملQuery Architecture Expansion in Web Using Fuzzy Multi Domain Ontology
Due to the increasing web, there are many challenges to establish a general framework for data mining and retrieving structured data from the Web. Creating an ontology is a step towards solving this problem. The ontology raises the main entity and the concept of any data in data mining. In this paper, we tried to propose a method for applying the "meaning" of the search system, But the problem ...
متن کاملSemantic Notation and Retrieval in Art and Architecture Image Collections
In this paper, we analyze various methods used for semantic annotation and search in a collection of art and architecture images. We discuss the Art and Architecture Thesaurus, WordNet, ULAN and Iconclass ontology. Systems for searching and retrieval art and architecture image collections are presented. We explore if the MPEG 7 descriptors are useful for art and architecture image annotations. ...
متن کاملCreating a National Content and Service Infrastructure for the Finnish Semantic Web
We present a national ontology development and service framework being developed in Finland in 2003-2007. Our goal is to initiate and support collaborative ontology development processes of various expert groups now developing keyword thesauri. The framework is based on a set of related core ontologies, most notably on a national upper ontology based on the commonly used Finnish General Thesaur...
متن کاملEnhancing Web Search with Heterogeneous Semantic Knowledge
This paper explores four kinds of semantic knowledge to improve keyword-based Web search, including thesauruses, categories, ontologies, and social annotations. These heterogeneous semantic knowledge represent meanings of Web information, thus they can be used to improve search results in respect of semantic relevance. Currently, different semantic search paradigms have been developed for diffe...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- IJMSO
دوره 3 شماره
صفحات -
تاریخ انتشار 2008